credit quality
Uncovering the Source of Machine Bias
Hu, Xiyang, Huang, Yan, Li, Beibei, Lu, Tian
We develop a structural econometric model to capture the decision dynamics of human evaluators on an online micro-lending platform, and estimate the model parameters using a real-world dataset. We find two types of biases in gender, preference-based bias and belief-based bias, are present in human evaluators' decisions. Both types of biases are in favor of female applicants. Through counterfactual simulations, we quantify the effect of gender bias on loan granting outcomes and the welfare of the company and the borrowers. Our results imply that both the existence of the preference-based bias and that of the belief-based bias reduce the company's profits. When the preference-based bias is removed, the company earns more profits. When the belief-based bias is removed, the company's profits also increase. Both increases result from raising the approval probability for borrowers, especially male borrowers, who eventually pay back loans. For borrowers, the elimination of either bias decreases the gender gap of the true positive rates in the credit risk evaluation. We also train machine learning algorithms on both the real-world data and the data from the counterfactual simulations. We compare the decisions made by those algorithms to see how evaluators' biases are inherited by the algorithms and reflected in machine-based decisions. We find that machine learning algorithms can mitigate both the preference-based bias and the belief-based bias.
Designing Inherently Interpretable Machine Learning Models
Interpretable machine learning (IML) becomes increasingly important in highly regulated industry sectors related to the health and safety or fundamental rights of human beings. In general, the inherently IML models should be adopted because of their transparency and explainability, while black-box models with model-agnostic explainability can be more difficult to defend under regulatory scrutiny. For assessing inherent interpretability of a machine learning model, we propose a qualitative template based on feature effects and model architecture constraints. It provides the design principles for high-performance IML model development, with examples given by reviewing our recent works on ExNN, GAMI-Net, SIMTree, and the Aletheia toolkit for local linear interpretability of deep ReLU networks. We further demonstrate how to design an interpretable ReLU DNN model with evaluation of conceptual soundness for a real case study of predicting credit default in home lending. We hope that this work will provide a practical guide of developing inherently IML models in high risk applications in banking industry, as well as other sectors.
How to measure fairness when an algorithm decides
Companies and governments delegating or supporting decisions in machine learning algorithms provoke concern and even opposition. This is because high-stakes decisions are being automated and there is evidence that algorithms can replicate or amplify existing biases. The problem is that these issues are not fully resolved even for when decisions are made by people, so there are no general criteria that can be clearly transferred to an algorithm. For example, when it comes to promoting gender fairness in recruitment, should men and women have the same opportunity, and should competences determine who gets the position? Or should you fill a vacancy to maintain parity or a quota, even if it involves ignoring more capable candidates? Issues like these always arise when trying to ensure fairness, or avoid discrimination, in any aspect of the human condition where there are illegitimate differences or when there are vulnerable groups.